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ON  THE  SIRICT  DETERMINATION  OF  HYPOTHESIS  TESTING  GAMES^ 

by 

Richard  Clark  Persons 

Department  of  Mathematics 
Massachusetts  Institute  of  Technology 

Section  0.  A Rough  Statement  of  the  Question 

Consider  the  following  game  for  two  players.  Player  I picks  some 
measure  Pg  from  the  collection  {P^:  0e[O,l]}  . Each  Pg  is  a 
probability  measure  on  [0,1]  equipped  with  the  usual  Borel  sigma-field. 

A point  X is  then  selected  at  random  via  Pq  » the  measure  chosen  by 

0 1 1 

Player  I.  Player  II  must  then  guess  whether  0qC[O,^]  or  • 

If  he  is  right  he  wins;  if  not  he  loses. 

A strategy  for  Flayer  I is  a measure  v on  0 = [0,1]  , the  set 
of  all  possible  values  of  0 . A strategy  for  Player  II  is  a function 

c|):  [0,1]  -*■  {0,1}  , where  <>(x)  = 0 indicates  the  guess  that 

Ooe[0,^]  , and  0 x)  = 1 indicates  the  guess  of  0q£  l|-,l]  . , 

i 

The  question  to  be  examined  in  this  paper  is;  if  for  each  Player  I ^ 

strategy,  v , there  is  a Player  II  strategy  that  will  win  with  i 

probability  one,  is  'here  a single  strategy  <{)  that  will  win  with 

probability  one  for  any  strategy  v of  Player  I?  i 

This  question  h<  3 an  application  to  the  issue  of  Bayesian  statistics  | 
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versus  classical  statistics.  Consider  Player  I to  be  nature  and  Player  II 
to  be  the  statistician.  If  the  omniscient  Bayesian  statistician  (always 
knowing  nature's  prior  v),  can  always  make  the  right  de:ision  in  a 
certain  situation,  can  the  classical  statistician  always  iiake  the  right 
decision  in  the  same  situation? 

The  above  is  a special  case  of  a more  general  question:  Does  every 

Hypothesis  Testing  Game  have  a value? 

Consider  the  game  as  aoove.  Player  1 selects  0 , then  a point 
X is  selected  according  to  P^  . Player  II  sees  x and  guesses  whether 
9 c [0,^]  or  9 e (]rJ  ] , winning  1 if  correct  and  0 if  not.  A 
strategy  of  Player  I is  a probability  measure  v on  [0,’],  the  set  of 
all  possible  values  of  6 . He  selects  6 via  the  niea  ure  v . 

A strategy  for  Player  II  is  a function  0 mapping  [0,1]  , the  set 
of  all  possible  values  of  x , to  [0,1]  . We  let  (I)(x)  be  the 
probability  of  guessing  that  9 c [0,^]  , when  x is  the  observed  value. 

Define  the  expected  payoff  of  Player  II  as 


M(v,<J.)  = ffl  , (9)<f(x)P.(dx)v(d0) 
[0,1]  ® 


(0)(l-4)(x))Po(dx)v(d 


The  upper  value  o'''  the  game  is  defined  as 

U = inf^  sup^  M(v,(f)  . 
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The  lower  value  L is 


L = sup^  inf^  M(v,4))  . 

A game  is  said  to  have  a value  if  U equals  L . 

What  is  considered  in  this  paper  is  the  case  for  which  U is  one. 

We  ask  the  question  whether  this  implies  that  L is  one  also. 

Section  1.  Introduction  and  Summary 

From  this  point  on,  unless  stated  otherwise,  we  will  ass  Tiie  that 
{Pg,  0e[O,1]}  is  a collection  of  countably  additive  probability  measures 
on  the  unit  interval,  equipped  with  the  usual  Borel  sigma-field.  Also 
needed  is  a measurability  condition  on  {Pg}  as  a function  of  0 . 

For  every  Borel  subset  A of  [0,1]  , assume  that  P0{A)  is  a Borel 
function  of  0 with  respect  to  the  usual  Borel  si()ma-field  on  [0,1]  . 
This  measurability  condition  will  be  called  assumption  M . 

For  ease  of  notation,  we  make  the  following  definitions: 

0 = [0,1]  , 0^  = [0,^]  , 0^  = {pi] 

6g  = the  usual  Borel  sigma-field  on  0 , 

X = [0,1]  , 8^  = the  usual  Borel  sigma-field  on  X , 

Pi  = {Pq  : 0 ^ , ?2  = {Pq  : eeG^}  . ' 

Of  course,  X could  be  any  complete  separable  metric  space,  and 


m 


similarly  for  0 . The  sets  0-|  and  O2  could  be  arbitrary  disjoint 
Borel  subsets  of  0 . 

The  families  P-|  and  P2  can  be  orthogonal  in  three  ways:  pair- 
wise, weak,  and  strong.  Pairwise  orthogonality  of  P-|  and  will 

mean  that  every  member  of  P-j  is  orthogonal  to  every  mtmber  of  P2  • 

Weak  orthogonality  will  mean  that  for  every  probability  neasure  v-j  on 

0-|  and  V2  on  O2  the  measures  P^v-jlde)  and  Pf,\2(d")  are 
orthogonal.  Strong  orthogonality  will  mean  that  there  e>ists  a Borel  set 
B which  supports  every  elenient  of  P^  and  B supports  every  element 
of  P^  . 

The  three  types  of  orthogonality  will  be  denoted  by  "P^  , (pr)P2"  , 

"P-|  ^ (w)P2"  , and  "P-|  I (s)P2"  , respectively. 

Now  that  the  notation  has  been  introduced,  let  us  mt-ntion  why  the 
measurability  Assumption  M is  needed.  Without  this  as  umption,  it 
is  easy  to  find  families  P-j  and  P2  that  are  "weakly"  separated 
but  not  strongly  separated.  What  follows  is  such  an  exanple. 

Let  A be  some  analytic  subset  of  X that  is  not  I orel . (An 
analytic  set  is  the  forward  image  of  a Borel  set  via  a Borel  measurable 
function.  See,  for  example,  Kuratowski,  Volume  I,  (1966),  or  Parthasarathy 
(1967).)  Parameterize  (P  } in  which  a way  that  P-|  = {6^:  x e A) 
and  P2  = (6^:  xcA^}  . As  is  shown  in  Section  2,  is  a legitimate 

family  under  the  measurability  assumption,  though  is  not. 

Analytic  sets  and  complementary  analytic  sets  are  measurable  with 
respect  to  the  completion  of  any  given  Borel  measure.  (Pefer  to  Kuratowski, 
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Section  39.)  The  obvious  definition  of  an  average  of  elements  of  P.| 
is  any  measure  whose  completion  assigns  measure  one  to  the  set  A . A 
similar  definition  would  hold  for  averages  of  elements  of  . Let 

be  any  average  of  P.j -measures  and  let  be  any  average  of 

P^-measures.  Since  the  completion  of  gives  A measure  one,  there 
is  a Borel  set  c A such  that  Q^(B^)  = 1 , and  similarly  there  is 
a Borel  set  B^c  A*'  such  that  ~ ^ follows  that 

Q.J  i giving  P^  ^{v))P^  . However  P.j  and  P^  are  not  strongly 
separated  by  Borel  sets  since  A is  not  a Borel  set. 

Assumption  M could  be  weakened  in  one  way  suggested  by  the  following 

example.  Let  P.j  = {r(0)6Q  + (1 -r(0) ^ ^ 

= {r(0)62^3  + (1  -c(e)  )6.|  : 6 c where  r : G (0,1 ) is  any 

function.  If  r is  a non-measurabl e function  then  Assumption  M is 
not  satisfied  even  tncugh  the  classes  P.|  and  P^  are  easily 
separated  by  a Borel  set.  If  P^  , P^  are  any  two  classes  of 
measures  gener.ited  under  Assumption  M then  any^  deformation  of  the  proba- 
bilities (P0vB)}  for  B c B^  that  leaves  0 and  1 probabilities 
invariant  will  not  a feet  strong  separation. 

Now  consider  thf  various  definitions  of  orthogonality  of  two  classes 

of  measures.  It  is  clear  that  strong  orthogonality  implies  weak  ortho- 
gonality which  implies  pairwise  orthogonality.  Pairwise  orthogonality 
does  not  imply  weak  orthogonality  as  is  illustrated  by  example.  Let 
= {620=  0 

on  [0,1]  . Clearly, 


and  let  P^  2 {>,}  where  X is  Lebesgue  measure 
P^  1 (pr)P2  , yet  ^ ~ J average  of 
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elements  of  , is  Lebesgue  measure  on  [0,1],  giving  fiat  P-|  and 
P^  are  not  weakly  orthogonal . 

Under  Assumption  M,  we  have  no  examples  of  weakly  s 'parated  families 
that  are  not  strongly  separated.  Some  of  the  results  tha'  are  oroven  are 
stated  below,  and  their  proofs  are  contained  in  the  folio  /ing  sections. 

If  P|  and  p2  contain  discrete  measures  only,  :hen  under 

Assumption  M,  pairwise  orth  igonal i ty  implies  strong  orthcqonal i ty.  this 
is  proved  in  Section  1 and  aection  3. 

If  P-|  and  P2  consist  of  measures  that  are  don  inated  by  a 

single  measure,  then  pairwi  .e  orthogonality  implies  strorn  orthogonality. 
Assumption  M is  not  needed  here,  in  fact,  no  measurability  condition  is 
needed  at  all.  These  facts  are  demonstrated  in  Section  4 

Let  P-|  and  P^  consist  of  measures  having  a di  >crete  part,  and 
a continuous  part  which  is  dominated  by  a single  sigma-finite  neasure. 

Then  under  Assumption  M,  weak  orthogonality  implies  strotig  orthogonality. 
Ihis  is  proved  in  Sections  S and  6. 

* Le£  0^  j he  the  distribution  of  the  rat  iom  variable 

^ where  {Y|^  > are  independent  Bernoulli  randon:  variables 

for  which  P{Y.  =1}  = y . Call  S = {Q  : yo[0,1]}  the  coin  tossing 

K y 

family  of  measures.  The  family  5 is  an  undominated  fan  ily  of  continuous 
measures  on  [0,1]  . In  Section  7 it  is  shown  that  if  , P^C  S then 
under  Assumption  M,  pairwise  separation  of  P^  with  P^  is  equivalent 
to  strong  separation. 
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Section  2.  Ihe  Most  Basic  Case 

Let  us  first  examine  the  case  for  which  x , and 

^7  = <^3(0)  ’ measure  Pq  is  just  the  point  mass  measure 

at  some  point  a(0)  . 

Lemmd_2_^l . Under  Assumption  M,  the  function  a:  0-^X  is  a Borel 
Tieasurabl  e f_ujL'-t^ion_  of_  6 . 

Proof:  Trivia  l since  (6:  a(0)cB}  = {G:  Pq(B)  = 1)  and  if 

3 second  snt  is  a Borel  set  of  O's  by  Assumption  M . □ 

^i_Uion_2.  . Under  Assumption  M vf  P^  1 (pnlP^  an^ 

, = 5 , . , then  tt ere  exists  a Borel  subset  B of  [0,1]  , such 
0 av  ) — 

that  P (B)  = 1 if  0 cO^  ^ P^(B)  = 0 i_f  '' ®2  ’ 

Proof:  By  Lemra  2.1,  a(0)  is  a measurable  function,  so  that 
3 -|)  and  a(02)  aie  analytic  sets.  The  sets  a(0-j)  and  a(02)  are 
disjoint  since  the  c asses  P-j  and  ?2  are  sepirated  by  pairs.  By 

the  analytic  set  sep  ration  theorem  (Kuratowski,  Section  39),  there  is 
a Borel  set  B that  contains  a(0-|)  and  whose  complement  contains 
a(0p)  . We  will  the  have  that  P.  (B)  = 1 or  0 , according  as 
0 c 0-|  or  0 G 2 • ^ 


Section  ^ _ lJie_Gene>  ^ Discrete  Case 

In  this  sectio'  , we  will  treat  the  case  in  wiich,  for  all  6 , 

Pf.  is  a purely  dis  rete  measure.  What  is  to  be  proved  is  that,  under 


Assumption  M,  if  P and  P are  separated  by  pairs  then  they  are 
separated  stro  igly. 
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In  Section  2,  each  measure  was  a point  mass  a a(‘'/  , which 

was  shown  to  be  a Bore!  fun  tion.  A similar  argument  wi ! work  in  this 
case.  Rather  than  the  one  function  a(0)  , a countable  r jinber  jf  functions 
^ce  reeded.  For  etch  n,  k,  f (e)  will  be  cie  of  the  atoms 

of  Pp  . (For  some  values  of  n,  k and  0 , f^  l^(P)  might  oe  a special 
value  to  indiiate  that  f^  (n)  is  not  an  atom  of  P^^  The  reason  for 
this  will  become  clear  beloa.] 

The  functions  f^  are  defined  as  follows.  Cot  .ider 

={xc[0.1]:P^ix}2l^'-  If  A^  ha  sat  least  k t Ic-ments  then 

define  f .(f)  to  be  the  k^’'  largest  element  in  A*^  . If  A*  does 

not  have  at  least  k elements,  then  define  f , (0)  to  -e  mitiis  one, 

n ^ K 

the  special  element.  The  fmction  l^(  ) is  then  the  k^'^  lamest 
atom  of  Pq  having  probability  greater  than  or  equal  t n i . (Largest 

is  in  the  sense  of  the  ordering  on  X = [0,1]  , not  the  nost  probable.) 

As  defined,  f^  1^  is  then  a Borel  function  by 

Lemma  3.1.  Under  Ass  u, apt  ion  M,  for_^ch  n = 1,2,...  and 
k = l,2,...,n.  f^  1^:  < *■  [0,l]u{-lj  is  a Bore!  mea^]^.  bl^ fun£t_i^n , 

People  As  is  usual,  it  is  enough  to  show  for  eac!  fixed  xc  [0,1]  , 

that  the  set  = {»;  f^  |^(G)  xl  is  a Borel  subse'  of  ( . Let 

n,  k,  X be  fixed.  Define  ^he  sets  of  9 : 

'^S,m  = P,{hVI''’',(J.n)/?"')n[x,l]}  > 1 

defined  for  m = 1,2,...  ar  i *.  = 0,1  , . . . ,2*^  , 
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i > 


[ 


I 

i 

! 

I 

I 


= {0:  there  exist  £(1 ) ,« (2) , . . . , £(k  j such  that 

0<£U)  £(2)<. . .<£(k)''  2"^  and  that  0-  bJ,.,  1 

defined  for  m = 1,2,  . . , 

n k n k 

B ’ = {):  there  exists  M > 1 such  that  hen  ,,  B ’ } . 

- m=M  m 

The  sets  B'^  are  Bore!  sets  by  Assumption  M.  The  sets  B*^’^  , 

£,m  m 

will  also  be  Borel  se‘s  since  they  are  obtained  through  countable  inter- 
sections and  unions  o*  sets  b"?  . We  clain  that  = B^’*^  . This 

1 £,rn 

is  true  because  P {y  > “ small  intervals  I containing 

0 — n y 

y have  probability  o'"  at  least  ]-  . The  set  is  the  set  of  0 for 

n m 

which  at  least  k intervals  of  the  form  [£/2'^, ( £+1  )/2'^)  have  P„ 

u 

probability  at  least  ^ . Clearly,  in  order  that  f^  1^(9)  ^ x it  is 

n k 

necessary  and  sjfficir*nt  that  B ’ occur  all  but  finitely  often.  Thus 

m 

^n,k  _ gn,k  clained,  proving  that  for  all  n,  k,  f (•)  is  a Borel 

n ^ K 

measurable  function.  Q 

Remark:  The  pr  lof  of  the  lemma  nowhere  uses  the  fact  that  the 

P 's  are  purely  discrete;  only  Assumption  M is  used.  We  will  make  use 

of  this  fact  in  Section  6. 

We  are  now  in  a oosition  to  prove  the  following. 
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1 heorem  3.1.  If  for  a1 1 0,  Pq  is  a purely  disc  ete  measure. 

Assumption  M holds,  and  Pj  I (pr)P2  then^  1 (s)P2 

Praqf:  Since  f^  is  a measurable  function,  the  ^ets 

fn  analytic  sets  by  the  definition  of  analytic 

set.  Let  A,  . f„,k('  |)v-l-  and  = u„  ^ -U  . 

The  sets  A-j  and  A^  are  then  analytic  since  the  countable  union  of 
analytic  sets  is  an  analytic  set.  (See  Kuratowski.)  A.j  , A^  mus^  also 
be  disjoint,  since  P.|  i \pr)p^  , forcing  P.| , P^  to  have  no 
atoms  in  common.  We  now  can  apply  the  analytic  set  separition  theorem 
as  in  Section  2 to  get  a Borel  set  B that  contains  and  whose 

complement  contains  . The  set  A^  contains  all  atom,  of  measures 
in  P,  , since  if  P„{xl  ■ 0 then  P„‘x}  > — . for  some  n'  giving 

f I 1.(0)  X for  some  k'  n'  . We  then  have  that  Po(B)  = 1 for 

e e 0.J  and  Pq(B)  = 0 for  0 r.  . □ 

Remark:  There  is  a slightly  cleaner  proof  of  this  theorem  not 

using  Lemma  3.1  --  however  the  proof  given  above  lends  itself  directly 

to  Section  5 below,  whereas  the  other  proof  does  not.  Th.^  alternate 

★ 

proof  is  this:  The  set  A = •'  (x,e):  Pp(xl>0,  0 c 9.| } is  a Borel 

★ ★ 

set  in  the  plane.  The  set,  ttj^(A  ),  the  projection  of  A onto 
X = [0,1],  is  then  an  anal /tic  set.  As  above  , apply  the  anal/tic 
set  separation  theorem  to  yield  the  desired  result. 
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Section  4.  The  Case  of  Dominated  Measures 

In  this  section,  we  will  deal  with  the  situation  in  which  the 
collection  {P^,}  is  dominated  by  some  sigma-finite  measure  a,  that 
is,  <<  X for  all  0 . 

Let  .'1^,  be  two  collections  of  sigma-finite  measures  on  a 
measurable  space  X . We  say  that  and  are  equivalent  if: 

iiii(E)  = 0 for  all  m^  in  implies  m^CE)  = 0 for  all  in^  c > 

and  conversely.  The  following  lemma  of  Halmos  and  Savage  (1949)  allows 
us  to  treat  the  dominated  case. 

I.emma  4.1.  Every  dominated  set  of  measures  has  an  equivalent 
countable  subset. 

Proof:  Refer  to  Lemma  7 of  Halmos  and  Savage  . □ 

The  following  easy  result  is  also  useful. 

Lerana  4.2.  Let  P,  R be  two  countable  sets  of  sigma-finite 
measures  on  some  measurable  space,  then  if  P and  R are  separated 
by  pairs,  they  are  al so  separated  strongly. 

Proof:  Say  that  P = {P.j,P2,...)  and  R = {R.j,R2,...)  . For 

each  i,  j,  P^  i Rj  by  assumption,  so  there  exists  a Borel  set 
such  that  P^(B^j)  = 0 and  ” 0.  If  we  define  the  Borel  set 

B to  be  ^.j  T’lj  then  for  all  i and  j , P.(B^)  = 0 and 

Rj(B)  = 0 . 


□ 
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We  can  now  tie  these  two  leinnas  together  to  get 

Theorem  4.1.  Let  P-j  1 {pr)?^  and  let  there  be  a sigma-finite 
measure  X such  that  Pg  <<  X for  a1 1 ne[0,l],  Uien  i (s)p2  . 

Proof:  By  Lemma  4.1,  and  P^  each  has  a countable  equiva- 

lent subset.  Now,  use  Lenina  4.2  on  the  countable  equival  .-nt  subsets  of 
and  P^  to  obtain  the  desired  result.  □ 

Rem^>^k:  Assumption  M was  not  required  at  all;  onl>  the  pairwise 

separation  of  the  two  classes  was  needed.  Assuming  a cla^s  to  be 
dominated  is  a very  strong  assumption. 

Up  to  this  point  we  have  not  needed  the  assumption  that  the  classes 
P.|  and  be  weakly  separately,  just  pairwise  separation  was  used. 

In  the  following  section  we  will  require  the  weak  separation  assumption. 

Section  5. An  Arbitrary  Measure  Versus  a Class  of  Discre te  ]^_asjjres . 

What  has  been  treated  so  far  are  the  cases  in  which  either  all 
measures  are  discrete,  or  all  measures  are  absolutely  continuous  with 
respect  to  a single  measure  (such  as  Lebesgue  measure).  In  this 
section  we  shall  examine  the  situation  of  one  continuous  measure  in 
contrast  to  a collection  of  purely  discrete  measures. 

Precisely  what  is  assumed  is  that  P.j  - {Q1  for  some  probability 
Q , and  P^  consists  of  ourely  discrete  measures.  As  we  know  from 

Section  3,  A = {atoms  of  Measures  in  P^ ‘ is  an  analytic  set.  While 
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the  set  A need  not  ’)e  a Bore!  set,  it  is  measurable  with  respect  to  Q , 
more  precisely  with  respect  to  the  completion  of  the  measure  Q . 

The  completion  of  a measure  p will  be  denoted  as  p . The 
quantity  Q(A)  is  then  well  defined.  We  are  led  to  the  following 
proposition. 

Proposition  5.1.  K 

(1)  Assumption  M holds , 

(2)  P.J  iQl,  •'’2  are  all  purely  discrete. 

^3)  Q' P2  atoms  ; = 0 

then  the  f ami  1 i es  1 j and  P are  strongly  separated. 

Proof:  Let  A = i atoms  of  measures  in  P2}  . The  set  A is 

analytic  so  there  exist  Borel  sets  ,B2  such  that  B.j  C A c B2 

and  Q(B2)  = Q(B.j)  . But  Q(B2)  = 0 by  assumption,  giving  that 
P (B2)  = 0 for  L ^ , and  P (Bp)  = 1 for  p c 0p  . (That  is, 

and  Pp  are  strongly  separated  families  of  probability  measures). 

□ 

Wiidc  remains  is  the  case  in  which  Q(A)  > 0 . If  Q(A)  > 0 then 
it  is  not  possible  that  the  families  P.j  and  Pp  can  be  weakly 

separated  as  will  be  shown  below.  This  is  accomplished  by  finding  an 
appropriate  probability  measure  v on  0p  . The  measure  u =/p0v(de) 

will  be  an  average  of  measures  in  Pp  yet  not  orthogonal  to  the  -Measure 

Q , giving  a contraciction  of  weak  separation. 
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In  order  to  find  the  desired  measure  , the  first  step  is  to  find 
a "nice"  subset  of  the  set  of  atoms.  This  is  facilitated  by  the  following 

Lemma  5.1.  Under  Assumption  M,  rf  Q{ atoms)  > 0 then  there 
exist  n'  , k'  such  that  k'^  ’2^  ''  ® ‘ 

Proof:  Follows  direi  tly  from  the  proof  of  Theorem  3.1  and  the 

measurability  of  analytic  sets.  fj 

From  this  point  on,  assume  that  some  n'  , k'  have  .een  chosen 
so  Qif  , . ,(©p)}  > 0 . Denote  the  function  f i i, . by  f . 

A heuristic  argument  of  what  is  to  be  attempted  follows.  What 
we  would  like  to  do  is  to  construct  v from  Q and  f in  a way 
that  would  give  v(f  ^(-))  = Q(’)  • This  would  identify  null  sets  of 
Q with  those  of  v . If  u(-)  were  defined  as  )v{de)  then 

null  sets  of  Q would  have  u-probabi 1 i ty  less  than  one.  This  would 

be  true  since  for  N a Q-riull  set,  v assigns  probability  less 
than  one  to  N . This  argument  will  be  rigorized  below. 

However,  this  approach  cannot  be  used  as  it  stands  since  \ is 
only  defined  on  a sub  sigma-field  of  the  Borel  sigma-field,  namely 
that  generated  by  the  measurable  function  f . What  follows  basically 
is  a means  of  extending  the  definition  of  v to  the  entire  Borel 
sigma-field. 


j 


I 
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The  paper  “Borel  Structure  in  Groups  and  Their  Duals"  by  George 
Mackey  (1957)  contains  the  needed  results.  He  defines  a Borel  space  to 
be  standard  whenever  it  is  Borel  isomorphic  to  the  Borel  space  asso- 
ciated with  a Eorel  subset  of  a complete  separable  metric  space.  A 
sigma-finite  measure  X on  a Borel  space  S is  standard  if  there 
exists  a Borel  subset  E of  S so  that  E regarded  as  a subspace 

of  S is  standard  and  that  X(S\E)  = 0 . In  this  paper,  all  Borel 

spaces  and  all  measures  used  are  standard,  by  definition. 

The  theorem  belcw  is  a direct  statement  of  Mackey's  Theorem  6.3  in  the 
paper  mentioned  above. 

Theorem  5.1.  l^t  S.j  ^n^  be  Borel  spaces  and  let  be  stanard. 

Let  p be  a standard  measure  in  S.|  . Let  A be  a Borel  subset  of 

S-j  $2  such  that  for  each  x c S.|  there  exists  y c so  that 
(x,y)  e A . Then  there  exists  a Borel  subset  N ojf  S.|  and  a Borel 
function  ^ ffoni  S.j\N  ^ such  that  (x,([)(x))  c A for  all 
X e S^N  ^nd  ; (N)  = 0 . 

This  theorem  and  some  of  its  ramifications  is  treated  in  detail  in 
the  monograph  of  T . Parthasarathy , (1971 ) , Selection  Theorems  and  Their 
'IPPjJ^^tjon^. 

We  can  apply  th«  theorem  in  the  following  way.  The  graph  G = (f(o),0) 
of  the  function  f is  a Borel  set  in  the  product  space  X x ©2  . Then 

f(  )2)  is  m^(G)  for  the  projection  map  onto  X . By  the  assumptions 

★ 

above,  there  exists  a Borel  set  B contained  in  f(02)  such  that 
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Q(B  ) > 0 . The  set  (B  ) will  be  a Bore!  set  in  the  ur 'duct 

space  which  will  serve  as  the  set  A of  the  theorem  above. 

The  application  of  the  theorem  is  stated  as 

Lemma  5.2.  For  f and  Q defined  as  in  this  section,  there  exists 
a Borel  subset  B'  of  f(0.,)  wjth  Q(B'}  ' 0 , and_^_B  >rej_  ni'.'asurable 
function  mapping  B'  - ^ l^^t,  f o r_  aJJ  x > B'  , 

(x,.t)(x) ) £ {(f  (e)  ,0) : e £ 0,,}  . 

- 1 * 

Proof : Let  A = GO  ^ (g  ) which  is  a Borel  subset  o<"  X ' . 

★ 

The  set  B is  a subset  of  f(02)  so  that  maps  A to  all  of 

■At  ★ 

B . The  set  B can  serve  as  the  Borel  space  S.|  of  Theorem  5.1. 


The  set 


serves  as  . The  theorem  asserts  the  existence  of  a 


Q-null  set  N and  a Borel  'unction  (ji  mapping  B\N  to  such 

that  (x,:(x))  t G for  all  x » B \N  . Then  the  set  B'  = B*\N  is 
the  desired  set  and  the  fun  tion  ! is  the  desired  function.  □ 


Remark : The  construction  of  (P  gives  us  that 


IS  a one-one 


function.  As  is  demonstrated  below  (p  is  a one  sided  inverse  of  the 
function  f . 


Define  the  measures 


on 


and  u on  X as 


(5.1) 


" qTF7 


and 


(5.2) 


m(-)  = /p„(-)x>(de) 


I 
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ihat  p cannot  be  orthogonal  to  Q is  demons trated  below  in  the  proof  of 

Proposition  5.2.  If 

( 1 ) Assumption  M holds , 

(2)  £ Q}  , are  all  purely  discrete, 

(3)  Qi  p2  atoms  I > 0 

then  P.|  and  P^  cannot  be  weakly  separated. 

Proof:  ;_et  , , v be  defined  as  in  (5.1)  and  (5.2).  We  want  to 

show  that  C c 8^^  and  Q(C)  = 1 implies  p(C)  0 , which  would 

demonstrate  that  p is  not  orthogonal  to  Q . First  recall  that  f 
is  f^,  1^1  for  some  n',  k'  . The  function  f has  as  its  image,  the 

point  i-1  and  atoms  of  P.  whose  P„  probability  is  at  least  — , . 

no  n 

We  then  get  that  for  any  C c 8 (denoting  A-  as  e ) 

A n 

(5.3)  Pg(C)  > cl(.(f(e))  . 

(The  element  -1  t X , and  C e 6^  so  -1  is  not  an  element  of  C.) 

So  we  have 

(5.4)  u(C)  > Ev(e:  f(e)eC}  = Q^  Q(B'n.j)'^  {(  : f(0)eC))  . 

Now  note,  if  y B'l  C then,  since  B'  C f(02)  there  is  at  least  one 

t ©2  for  which  f(o')  = y . By  the  construction  of  ({>  , 

?(y)  £ fO;  f(’»)  = y;  , so  that  if  y e B'  n C then 

y c B'  f(0)  ,C}.  Referring  back  to  (5.4),  for  any  C . 8^ 


the  given  ineq'iality  holds; 


I 

! 

1 
t 

t 


M { C ) Q { B ' 0 C ) . 

Suppose  that  Q(C)  = 1 , then  clearly 

p(C)  e . 

But  e > 0 so  that  u cannot  be  orthogonal  to  Q . Since  i.  is  an 
integral  average  of  measures  in  , we  have  shown  that  the  families 

P-j  and  P^  are  not  weakl>  orthogonal.  □ 

Propositions  5.1  and  5.2  lead  directly  to  the  desired  fact. 

Theorem  5.2.  Ijf 

( 1 ) Assumption  M holds  , 

(2)  P.|  ; fQ},  P^  consists  only  of  discrete  measures , 

then  if  P.|  and  P^  are  weakl y orthogonal,  they  are  also  strongly 
orthogonal . 

Proof:  Proposition  5.2  gives  that  if  P.|  and  weakly 

orthogonal  then  QfP^  atoms]  = 0 . Proposition  5.1  asserts  that  P.| 
and  must  then  be  strc)ngly  orthogonal.  □ 

This  section  is  now  complete.  In  the  next  section,  all  preceding 
results  are  combined  into  one  theorem. 

Section  6.  Mixed  Measures  with  a Dominated  Continuous  Part 

In  Section  3 all  measures  considered  were  purely  discrete.  In 


Section  4 all  measures  were  assumed  to  be  dominated  by  a single  sigma 
finite  measure.  The  last  section  dealt  with  the  case  of  a single 
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continuous  measure  versus  a class  of  purely  discrete  measures.  Now  we 
wish  to  combine  all  the  above  cases  into  one. 

Every  probability  measure  can  be  uniquely  decomposed  into  the  sum 
of  two  sub-probability  measures,  one  of  which  is  discrete  and  the  other 
purely  continuous.  We  then  have  that  for  each  e,  P_  = Pq  + Pq  • 

The  measure  P^  is  purely  discrete  (atomic)  and  the  measure  Pg  is 
purely  continuous. 

What  is  to  be  assumed  in  the  rest  of  this  section  is  there  exists 
a sigma-finite  measure  X such  that  for  all  0,  Pf  « A . 

D 

The  following  notation  will  be  useful.  Define 

= {P^:  0e  0^}  , = {Pg;  e G^} 

P^  = (Pg-.  0e  0^1  , = {Pg-.  ■>£  02^  • 

The  orthogonality  of  P^  and  P^  is  equivalent  to  the  orthogonality  of 

P^  with  P^  , with  P^  , P^  with  P^  , and  P^  with  pf  . 

Each  of  these  cases  will  be  considered  one  at  a time  starting  with  P^ 
versus  P^  . 

The  lemma  oelow  is  derived  in  Section  3. 

Lemma  6.1.  if.  = {atoms  of  measures  in  P^ } and 

= {atoms  of  measures  in  P^}  then  and  A^  are  disjoint 
analytic  sets. 

Proof:  Refer  to  the  proof  of  Theorem  3.1.  d 

The  next  case  to  be  dealt  with  is  that  of  P^  versus  P^  . By 
I ermia  4.1,  the  classe  . of  P^  and  P^  each  has  a countable  equivalent 
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subset.  (Whether  the  measures  involved  are  probability  m- asure'  or 
subprobability  measures  does  not  matter.)  Enumerate  the  (.ountaMe 


equivalent  subsets  (ces)  of  and  Pp  as  ces(P?)  = (P^ 

^ I (j  ■! 


I ^2 

and  ces(Pp)  = {P^  ,P^  } . 

We  are  now  ready  to  prove  the  major  result  of  this  section. 

Theorem  6.1.  If 


( 1 ) Assumption  M holds , 

( 2 ) there  exists  a sigma-finite  measure 

0 , ^0  ^ > 


sucti  that  for  all 


Uien_jJ[  P.|  ajTd  Pp  are  weakly  orthogonal,  they  are  al so  strongly 
orthogonal . 


Proof: 


Let  P 


P^  l(w)P, 


0 c ces(P^)  and  let  t ces(Pp)  . Since 

. . _ r 


2 » it  follows  directly  that  _i  P ^ . There  is  a Bore! 

set  B.j  for  which  ^ 


(B^.)  = 0 and  P^  (B..)  = 0 
®i  'j 


In  order  to  get  the  strong  separation  of  P.^  and  Pp  it  is  desirable 
that  B-jj-  A.J  and  ^ C Ap  , for  the  set  of  atonis  of  measures 
in  P.J  and  Ap  the  set  of  atoms  of  measures  in  Pp  . This  may  not 
be  so  for  arbitrary  B^^  . 

The  results  of  Section  5 lead  to  the  fact  that  for  all  e c 0^  , 

P0(Ap)  = 0 and  for  all  0 0p,  ^0(^1)  = 0 • Because  0^  * ).|  , 

Pq  (Ap)  = 0 , so  there  is  a Borel  set  ~)  Ap  for  which  P0,(C.j)  = 0 

Because  r.  s G-  , P (A, ) = 0 , so  there  is  a Borel  sot  D.  D A,  for 
J i j I J I 

which  P^  (Dj)  = 0 . The  set  E^j  = has  th?  properties 


I 


J 
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’.hat  E.j  3A^  , E.,  , and  Py.(E.j)  = 1 and  P^.(E.j)  = 0 . 

As  in  the  proof  of  Lemma  4.2,  let  E = E.jj  . The  set  E 

contains  A,,  is  contained  in  A^,  and  for  all  i,  j,  P„  (E)  = 1 

1 2 e. 

and  Pg  (E)  = 0 . Since  E contains  Pg(E^)  = 0 for  all  0 e 0.|  . 

The  set  E is  contained  in  A^  so  Pq(E)  = 0 for  all  0 e ©2  • We 
also  have  Pa(E^)  =0  if  0 is  a 0.  but  the  definition  of  countable 

U 1 

equivalent  subset  forces  Pg(E  ) = 0 for  al  1 0 f 0.|  . Similarly 

P^(E)  = 0 for  al 1 t e ©2  . 

What  has  been  shown  is  that  P0(E)  = 1 for  all  0 c 0-|  and 
Pg(E)  = 0 for  all  ) e • The  families  P.|  and  P2  separated 
strongly  if  they  are  separated  weakly.  □ 

The  results  of  this  section  can  be  extended  somewhat.  Following 
is  an  extension  that  lives  an  answer  in  some  cases  not  covered  so 

•"ar,  particularly  th  t of  certain  undominated  families  of  continuous  measures. 

Section  7.  Discrete-Like  Undominated  Tamil  i_es  of_  Measures 

Consider  the  fanily  of  coin  tossing  measures  mentioned  in  Section  1. 

It  was  stated  there  ’hat  if  P-|  and  P,^  consist  of  such  measures, 

then  weak  and  strong  orthogonality  are  equivalent.  The  coin  tossing 
family  is  an  example  of  a class  of  families  of  measures,  that  we  will 
call  Discrete-like  Undominated  Families  (DLUF  for  short). 

First  is  a defi  ition  of  DLUF's,  followed  by  the  proof  of  the  fact 
that  weak  and  strong  separation  are  equivalent  for  DLUF's.  At  the 
end  of  this  section,  we  will  prove  the  easy  fact  that  the  coin-tossing 
family  is  a DLUF. 

The  famil/  of  mrasures  S = fQ^}  for  ycF  = [0,1]  is  said  to 
be  a Discrete-  ike  Ur  dominated  Family  if  there  exists  a Borel  function 
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f;  X ► r that  partition  X into  disjoint  Borel  set'  in  a leasurabie 
way.  By  this  is  meant  that  f is  Borel  (with  respect  to  the  usual 
Borel  sigma-fields  on  X and  !'),  and  Q (f'^y))  = 1 and 

Q (f  '(y'J)  = 0 if  y ^ y'  . 

Y 

Suppose  now  that  S = fQ  } is  some  DLUF  and  P S . Let 

i 

g:  0 r be  the  function  that  identifies  each  9 with  its  corresponding 
Y-atom,  that  is,  for  each  9:0,  ^ ^q(e)  ‘ ‘'unction  g(-) 

is  analogous  to  the  function  a(-)  of  Section  2. 

Lemma  7.1.  I f S i_^  i DLUF,  PCS  and  Assumption  M holds, 
t_h^n  _^he__fujTC_ti^n  g:  0 I as  d_ef ined  abov_e_  i_s_  a_  Bo£e]  function. 


Proojf;  This  lemma  i>  a generalization  of  Lemma  2 1,  it<  proof  is 


almost  identical.  Let  B be  a Borel  subset  of  : , tl en 
{0:  g{9)  t B}C{0:  Pp(f"^(B))  = 1}  , since  g(e)  c B mplies 
j(f"^  (B) ) = 1 by  the  identification  of  f;  but  P^  = *^q(9) 
that  Pg(r^(B))  = 1 . The  relation  (6;  g(0)  c bIDTP;  Py(r^B))  = 1} 


holds,  because  if  Pg(f'^(b))  = 1 then  ^ (f' ' (B) ) = 1 forcing 

g(0)  e B due  to  the  "partitioning”  nature  of  the  functicn  f . Since 


-1 


{e:  P (f'\B))  = 
0 

function  of  6 . 


1 ’■ 


is  a Borel  set  by  Assumption  M, 


is  a Borel 

c 


Using  the  analytic  set  separation  theorem  as  in  Sec* ion  2 we  get 

Proposition  7.1.  If  PCS,  S some  DLUF  then  _und^ 

Assumption  M,  the^ pairwise  separation  of  the  families  and  P2 

impl ies  the  strong  separation_ of  P^  and  P^  . 

Proof:  Lemma  7.1  together  with  pairwise  separatiori  gives  us  that 

the  sets  A^  = g(0.|)  and  A^  = g(  disjoint  analytic  subsets 

of  1’  . By  the  analytic  • “paration  theorem,  there  exis  s B , a 


r 


1 
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5orel  subset  of  ' wnich  contains  A-j  and  whose  lomplement  contains 
A2  . The  set  C = f'(B)  is  clearly  a Borel  subset  of  X such  that 
Pp(C)  = 1 for  e c >1  and  Pq(C)  = 0 for  0 : ©2  • □ 


The  family  S is  similar  to  a collection  of  one-point  measures 
'^xcept  that  Q,  is  not  point  mass  at  y but  rather  like  "point-mass" 
it  the  "point"  f'^ ( . ) . The  generalization  to  measures  that  are  con- 
vex combinations  of  measures  in  S leads  to  a notation  of  discrete  measures 
/nose  "atoms"  are  of  he  form  f~^(>)  . 


Define  y = > i.Q  : x.  > 0 , ) ri.  - 1 , y . f r and  Q f S}  , 

1 1 - '■  1 1 Y 

set  of  all  countably  infinite  convex  combinations  of  elements  of  S . 

We  want  to  show  that  if  PcS*  then  weak  orthogonality  of  P-j  with 
P2  implies  sti^ong  0(  thogonal  ity.  This  is  accomplished  by  proceeding 
just  as  in  Section  3 


Let 


Pc  S 


We  wi  1 1 define  g , : 0 ■ 
n ,k 


similarly  to  k ' 


Let  a"  = (yc  ■ = [0.1]:  > - }■  If  A^  has  at  least  k 

0 p n 

elements  then  define  g . (O)  to  be  the  k^*^  largest  element  of  A^  . 

n , K 

If  a'^  does  not  hav  ■ at  least  k elements  then  define  g (0)  to 
0 n , k 

be  minus  one,  a special  element.  By  examining  the  proof  of  Lemma  3.1, 
it  is  easily  seen  fo  ■ n = 1,2,...  and  k = l,...,n  that  the  function 
g^  l^(-)  is  measurab  e if  Assumption  M holds. 


Proposition  7.2  [f  S is  a DLUF,  P c S*  , and  Assumption  M 


holds,  then 


P]  1 pr)P2  impl  ies  P-|  ^ (s)p2  . 


Proof;  As  stited  above,  the  function  g . is  a Borel  function 
- - — — n 5 K 

for  each  n,  I'  . ft  sets  A = u , g (0  )\{-l  and 

1 n,k  ^n,k'  1 ' 


and 
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^2  ~ k ^ disjoint  analytic  subsets  )t 

The  sets  and  consist  of  the  -,-atoms  for  the  i easures  in  , 
and  respectively.  There  is  a Borel  set  B such  tl  it  E A^ 

and  B n A^  = 0 . If  C = t~^(B)  then  as  before  C wi  1 separate  the 
families  P.|  and  . [J 

If  P consists  of  measures  of  the  form  P^  = Q j {&/)  and 
the  measures  are  dominated  by  some  A , then  paiiwise  separation 

gives  strong  separation  of  P|  and  • 

The  following  theorem  is  an  obvious  general  i za tion  t Tnet ''e.n  6.1. 

★ 

Theorem  7.1  . Let  -S  oe  sonie  DLUF  and  5 yie_^e'  pjy^ll_ 
cj) u n tabl y _i n ri nite  ^oji vex  c o ijib i n a t i o ns  of  members  of  S . Let 

( I ) Assumption  M hold , 

(2)  P = a(o)  /q  V (dy)  + {1- for  * 1 , 

u I u y 

★ ★ 

^ ajid  s , 

;^hen^f  P^  ^ny  are  weakly  orthogonal  , they  ar^  also  strongly 

orthogonal . 

Proof:  The  only  case  that  has  not  been  resolved  it  this  section 

that  was  considered  before  is  of  P.j  s ^^v(d>)}  and  P2  '7  S*  . 

But  this  is  like  the  case  of  P^  {v}  and  P^  a set  of  discrete 
measures.  As  before  the  set  of  Y-atoms  of  measures  is  an  analytic 

set  for  which  v{atoms}  = 0 . There  is  a Borel  set  B in  r that 
contains  the  y atoms  and  v(B)  = 0 . The  set  f^(B)  will  split 
the  single  continuous  measure  from  the  discrete  ones.  Ni  w go  on  as  before 
to  get  the  resul t.  □ 

Two  examples  of  DLUF''i  are  the  atomic  masses  and  th>  coin  tossing 


measures. 
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1 

j 


If  5={d  :yl  0,1]}  then  S is  a DLUF  sirice  f:  X ->•  r for 

f(x)  = X satisfies  the  requirements  stated  above.  It  follows  then 
that  all  the  results  of  Sections  2 through  6 are  special  cases  of 
the  Theorem  7.1. 

If  S - {Q^:  Yf  [0,1]}  for  the  coin  tossing  measure  with 

probability  y then  S is  a DLUF.  The  measure  is  the  distri- 

on  ^ 

bution  of  Y = } t:  where  the  X^,  are  i.i.d.  Bernoulli  random 

^ k=l  2*^ 

variables  with  parameter  y . Express  x e X = [0,1]  in  terms  of 

k 

Its  binary  expansion  x = bj^/2  and  if  x has  two  expansions, 

use  the  one  with  non-terminating  ones.  The  function 
b.|  . +b^ 

f(x)  = lim  sup^_,^(— - -j:| — - ) is  a Borel  function,  being  the  countable 
lim  sup  of  Borel  functions.  By  the  strong  law  of  large  numbers, 
Q-^.(f'\T))  = 1 and  Q.^,}f'\>'))  =0  for  r Y ' • The  family  S is 
then  a Discrete-Like  Undominated  Family. 

1 • 

I Section  8^ Concl  u^sjcm 

in  this  paper,  it  has  been  shown  that  for  families  usually  consi- 
dered in  the  study  of  hypothesis  testing,  weak  orthogonality  implies 
strong  orthogonal ity . For  discrete  families,  dominated  continuous 
families,  and  for  mi/tures  of  discrete  measures  with  dominated  con- 
tinuous measures,  thf*  two  notions  of  orthogonality  are  equivalent. 

At  this  time,  tue  author  does  not  know  of  any  counterexample. 

Nor  does  he  know  of  my  general  proof  of  the  equivalence  of  weak  and 
strong  separation,  i'  there  is  such  a proof. 
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